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ABSTRACT 

We present a homogeneous compilation of HI spectral parameters extracted from global 21 cm 
line spectra for some 9000 galaxies in the local universe (heliocentric velocity —200 < V© < 28, 000 
km s _1 ) obtained with a variety of large single dish radio telescopes but reanalyzed using a single 
set of parameter extraction algorithms. Corrections to the observed HI line flux for source extent 
and pointing offsets and to the HI line widths for instrumental broadening and smoothing are applied 
according to model estimates to produce a homogenous catalog of derived properties with quantitative 
error estimates. Where the redshift is available from optical studies, we also provide flux measurements 
for an additional 156 galaxies classified as marginal HI detections and rms noise limits for 494 galaxies 
classified as nondetections. Given the diverse nature of the observing programs contributing to it, 
the characteristics of the combined dataset are heterogeneous, and as such, the compilation is neither 
integrated HI line flux nor peak flux limited. However, because of the large statistical base and 
homogenous reprocessing, the spectra and spectral parameters of galaxies in this optically targeted 
sample can be used to complement data obtained at other wavelengths to characterize the properties 
of galaxies in the local universe and to explore the large scale structures in which they reside. 
Subject headings: galaxies: distances and redshifts — galaxies: fundamental parameters — radio lines: 
galaxies — techniques: spectroscopic — astronomical data bases: miscellaneous 



1. INTRODUCTION 

Neutral hydrogen constitutes a significant component 
of the interstellar medium of disk galaxies. The study 
of its abundance and kinematics, as measured by the 
21 cm emission line, has seen applications to a variety 
of astrophysical problems, ranging from studies of the 
large scale distribution of galaxies to secular evolution in 
clusters. HI line emission carries information about the 
distribution of neutral gas, and through tracing of its 
velocity field, of the total mass in a galaxy. Even in the 
absence of any spatial information about the distribution 
of the gas, the global HI line profile, contained within 
the beam of a single dish telescope or integrated over 
the entire HI distribution where resolved, provides three 
key indicators: (1) the observed redshift of the system 
as a whole; (2) the observed HI line flux, which, in the 
optically thin case, can be related to the HI mass; and (3) 
the observed HI line full width, which, when corrected to 
face-on viewing, is related to the total rotational velocity 
of the HI gas. These quantities have been used for diverse 
applications such as the tracing of large scale structure by 
gas-rich systems, the comparative HI content of galaxies 
in differing environments and the derivation of secondary 
distances using the Tully-Fisher method (Tully & Fisher 
1977). 

HI line systemic velocities, fluxes and characteristic ve- 
locity widths have been recorded for more than 20,000 
galaxies and presented in large published compilations 
such as that of Huchtmeier & Richter (1991) and the 
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Third Reference Catalog of Bright Galaxies (RC3, de 
Vaucouleurs et al. 1991). Until recently, the vast ma- 
jority of extragalactic HI line spectral parameters were 
associated with observational targets selected from op- 
tical catalogs. Only recently have large-area HI blind 
surveys been undertaken, notably the Arecibo HI Strip 
Survey (AHISS; Zwaan et al. 1997), the Arecibo Dual 
Beam Survey (ADBS; Rosenberg & Schneider 2000), the 
HI Jodrell All-Sky Survey (HIJASS; Lang et al. 2003) 
and the HI Parkes All-Sky Survey (HIPASS; Barnes et 
al. 2001; Koribalski et al. 2004; Meyer et al. 2004). Most 
recently, the blind HI survey conducted with the Parkes 
Telescope multifeed array (HIPASS) has catalogued 4315 
extragalactic HI line sources (HIPASS catalog; Meyer et 
al. 2004) south of Dec. < +2° identified purely on the 
peak flux of their HI line emission profiles. Perhaps sur- 
prisingly, extremely few systems detected in the HI blind 
surveys have no optical counterpart (Kilborn et al. 2000; 
Rosenberg & Schneider 2002), but understanding the dif- 
ference between optical- and Hl-selected samples is crit- 
ical to determining the true census of gas-bearing objects 
in the nearby universe and the relationship of present day 
disks to the damped Lyman a absorbers seen in QSO 
spectra. 

With the exception of the HIPASS dataset which has 
been processed uniformly and can be accessed digitally 
through a public website (http://hipass.aus-vo.org/), a 
significant disadvantage of past compilations arises from 
heterogenous processing and parameter extraction algo- 
rithms, as well as the lack of availability of the digital 
spectra themselves. In many cases, error or noise esti- 
mates are lacking and corrections for instrumental effects 
not applied or derivable. Because of our own on-going 
work related to the applications of HI spectral data in 
correlative and secondary studies, we have developed and 
maintained a digital archive of HI line spectra for some 
9000 optically-selected galaxies observed by us and our 
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numerous collaborators using various telescopes over the 
last 20+ years. Two of the telescopes used, the 91 m 
and 42 m telescopes of the National Radio Astronomy 
Observatory 3 at Green Bank, are no longer operational 
(in fact, the 91 m telescope no longer exists). While 
HI parameters for most of these datasets have been pre- 
sented previously, changing scientific objectives and com- 
puter capabilities have led to advances in the robustness 
of parameter extraction algorithms. In order to estab- 
lish a well-characterized and permanent digital archive 
of these global HI profiles, we have reanalyzed the global 
HI line profiles using a single set of processing and pa- 
rameter estimation algorithms. In this paper, we present 
a homogeneous compilation of the extracted HI line pa- 
rameters and provide quantitative corrections and error 
estimates. These data are being used in a variety of 
ongoing studies, including a robust estimation of the HI 
mass function (Springob, Haynes, & Giovanelli 2005) and 
studies of the local peculiar velocity field. 

In Section 2, we review the observational circumstances 
of the archival dataset as well as present new HI observa- 
tions undertaken with the Arecibo 305 meter telescope. 
Section 3 discusses the process of extracting useful pa- 
rameters from the global HI line profiles and the correc- 
tions required to derive physically meaningful quantities. 
The archive is presented in Section 4, followed by discus- 
sion of the characteristics of the data set, its complete- 
ness and its limitations in Section 5. A brief summary 
concludes in Section 6. 

2. OBSERVATIONS 

The majority of the HI line datasets included in the 
current compilation have been discussed in previous pub- 
lications. Here we review the general characteristics of 
the telescopes employed to conduct the observations, 
summarized in Table 1, and also present the results of 
new observations for a sample of 145 obtained with the 
upgraded Arecibo telescope. 

2.1. Archival Datasets 

Since the early 1980's, we have attempted to main- 
tain a digital archive of the extragalactic HI global pro- 
files we and our collaborators have observed at a number 
of large single-dish radiotclcscopes, namely the 305 m 
Arecibo telescope of the National Astronomy and Iono- 
sphere Center, the late 91 m and 42 m Green Bank tele- 
scopes of the National Radio Astronomy Observatory, 
the Nangay radio telescope of the Observatory of Paris 
and the Effelsberg 100 m telescope of the Max Planck 
Institut fur Radioastronomic. The basic parameters of 
those telescopes are summarized in Table 1, along with a 
compilation of the principal references to the published 
works which include discussions of the individual observ- 
ing programs, observational setups and target selection 
criteria. Observations at Arecibo employed several diff- 
ferent L-band feed systems, denoted by their designated 
names: one of the high gain dual circular polarization 
feeds ( "dual circular" ; two such similar feeds were avail- 
able, one of which was remotely tunable in frequency) 
and a low sidelobe linear polarization L-band feed ( "flat" 
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feed). During the 1990's, the Arecibo telescope was up- 
graded and a low noise, broadband secondary and ter- 
tiary reflector Gregorian optical system was installed; 
observations made with it are described in the follow- 
ing subsection. Nearly all observations were conducted 
in total power mode, with a position switching technique 
used to remove instrumental baseline effects. At each 
telescope, calibration of the noise diodes was performed 
by conducting observations of continuum sources from 
the catalog of Bridle et al. (1972), and both the flux and 
velocity scales were further checked by observations of 
a sample of strong HI line emitting galaxies. Typical 
on-source integration times were dictated by signal to 
noise requirements and varied from as little as 5 minutes 
at Arecibo to as much as 2 hours with the 42 m and 
Nangay telescopes. Final spectra are the accumulated 
weighted averages of multiple ON-OFF pairs. 

The main distinctions among the HI datasets arise 
from differences in the sensitivities achieved with the dif- 
ferent telescopes, and differences in the resolution and 
bandwidth associated with the available spectral back- 
ends. In order both to permit the analysis of instru- 
mental effects and to measure widths using the same al- 
gorithm, the accumulated and calibrated final spectra 
were exported from the respective on-line data format 
into the standard ANALYZ-GALPAC "history file" for- 
mat developed by RG at the Arecibo Observatory and 
were thus available in digital form for reprocessing using 
the algorithms described below. All accumulated spectra 
have been re-smoothed and re-baselined using an inter- 
active process that allows the user to select the degree 
of smoothing (typically Hanning, but sometimes also a 3 
channel BOXCAR in order to improve signal-to-noise, as 
noted in Table 3), choose the baseline regions and order 
(typically 3), flag the boundaries for flux integration and 
examine the results before they are stored away in the 
permanent archive. 

All spectra presented here have been reprocessed us- 
ing the same parameter extraction algorithms, producing 
new measurements which supercede those presented pre- 
viously. 

2.2. New Arecibo Observations 

In addition to the HI spectra already contained in the 
digital archive, new HI line observations were conducted 
with the Arecibo 305 meter telescope and its Gregorian 
optics system between September and November 2001. 
116 galaxies were detected, and an additional 29 galaxies 
for which we have optical heliocentric velocities yielded 
upper limits. Targets were selected from among inclined 
spirals included in our private database, referred to as the 
Arecibo General Catalog (AGC). The new observations 
preferentially targeted those objects which appeared to 
be good Tully-Fisher candidates and for which we did 
not already have HI spectral parameters, in the region 
of the Pisces-Perseus supercluster, that is 22 h < R.A. 
< 4 h , 0° < Dec. < +38°. 

The L-band narrow receiver was used in 9-level sam- 
pling mode for all observations, feeding each polariza- 
tion to four correlator boards with 1024 channels and 25 
MHz bandwidth (yielding a velocity resolution of 5.15 
km s _1 ). In most (but not all) cases, the redshift was 
not known in advance so that a search mode was em- 
ployed, staggering the four boards to cover a total of 85 
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TABLE 1 

Characteristics of Single Dish Telescopes and their Datasets 



Telescope/Feed 


Aperture 


Gain 


HPBW 


# spectra 


Original Reference 




(m) 


(KJy- 1 ) 


(') 






Arecibo "dual circular" 


305 


8.0 


3.3 


6475 


HG84,a,b,c,d,e,f,g,h,k,m,n,o,p,q,s 


Arecibo "flat" 


305 


6.0 


3.9 


20 


HHG.o.q 


Arecibo "Gregorian" 


305 


11.0 


3.5 


116 


this work 


Effelsberg 


100 


1.5 


8.8 


66 


rrj 


Green Bank 91 m 


91 


1.1 


10.0 


1090 


i,j,m,n,o 


Green Bank 42 m 


42 


0.25 


21.0 


706 


l,m,r 


Nangay 


300x34.6 


1.2 


4x20 


377 


m 



a Freudling et al. 1988 
b Freudling et al. 1992 
c Giovanelli et al. 1986 
d Giovanelli & Haynes 1989 
c Giovanclli & Haynes 1993 
f Giovanelli et al. 1997 
g Haynes & Giovanelli 1986 
h Haynes & Giovanelli 1991a 
'Haynes & Giovanelli 1991b 
j Haynes et al. 1988 
k Haynes et al. 1997 
'Haynes et al. 1998 
m Haynes et al. 1999 
"Magri et al. 1988 
°Magri 1994 
PScodeggio et al. 1995 
^van Zee et al. 1995 
r van Zee et al. 1997 
s Wegner et al. 1993 



MHz bandwidth. The first board was centered at the 
21 cm line rest frequency of 1420.40578 MHz, while the 
other three boards overlapped but with center frequen- 
cies offset at 20, 40, and 60 MHz below this frequency. 
On all nights, the system temperature was approximately 
30 K. All observations were conducted in a total power, 
position switching mode as a sequence of ON-source and 
OFF-source pairs of 4 minutes each. The final spectrum 
is the average of the combination of the successive ON- 
OFF pairs. The total number of ON/OFF observations 
made for each galaxy varied based on the signal-to-noise 
ratio, and ranged from one to four. On the first three 
nights of observations, each board was Doppler corrected 
to heliocentric velocity at the start of each ON-OFF pair. 
Beginning with the fourth night, the same Doppler cor- 
rection was applied to each of the four boards, allowing 
the spacing between correlators to remain at 20 MHz ex- 
actly. Beginning on the fifth night, we used the NAIC 
"radar blanker", which detects the San Juan airport's 
radar and suspends data-taking for brief intervals around 
the pulse arrival times, for all observations in which the 
rcdshift of the galaxy was either not known a priori or 
known to be such that the redshifted 1420 MHz HI line 
would overlap with the 1350 MHz frequency of the air- 
port radar. 

Whereas data reduction for all other Arecibo obser- 
vations presented here was performed using the Arecibo 
ANALYZ-GALPAC analysis system, the data reduction 
for these new observations was carried out using the 
AIPS++ software package 4 . The reduction algorithms 

4 The AIPSH — h (Astronomical Information Processing System is 
a product of the AIPSH — h Consortium. AIPSH — h is available for 
use under the Gnu Public License. Further information may be 



match those carried out for the earlier observations in 
ANALYZ-GALPAC (and discussed in Haynes et al. 1999 
and references therein) exactly, with one exception. We 
have used the 'setfreqmed' and 'settimemed' functions 
within the AIPS++ 'autoflag' package to eliminate in- 
dividual time-frequency pixels found to be corrupted by 
radio frequency interference (RFI). The RFI identifica- 
tion algorithm flags pixels that deviate by more than 
10 times the median deviation from a sliding median of 
width 10 pixels in both the time and frequency dimen- 
sions. The width of each pixel in the time domain is 1 
second. 

3. HI 21 CM LINE FLUXES, SYSTEMIC VELOCITIES AND 
LINE WIDTHS 

The parameters extracted from global 21 cm line pro- 
files obtained with single dish telescope require correc- 
tion before physical quantities can be derived from them. 
Here the process for conversion from observed to cor- 
rected parameters is described in some detail. 

3.1. Integrated Line Fluxes 

Modern radio telescopes can be continuum flux cali- 
brated to a small fraction of their system noise, but as 
discussed by van Zee et al. (1997), the accuracy with 
which HI line fluxes can be measured is much less pre- 
cise. Beyond the continuum flux calibration which can be 
achieved at the better than 2% level, the accuracy with 
which line fluxes can be measured depends both on base- 
line stability and on corrections that must be applied to 
the observed values. The HI line fluxes S b s — J S(V) dV 
presented here have been measured on the smoothed and 

obtained from http://aips2.nrao.edu 
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baseline subtracted profiles by an interactive process in 
which the user marks the range of correlator channels 
over which the flux is measured by integrating the emis- 
sion within these boundaries. The observed line flux S b s 
must then be corrected for (f) beam attenuation, (2) 
pointing offsets, and (3) HI self absorption. We discuss 
each correction separately. 

Since the HI distribution in a galaxy is often partially 
resolved by the telescope beam, the HI line flux detected 
by a given telescope underestimates the true flux by an 
amount that depends on how much the source HI distri- 
bution fills, or overfills, the beam. Since the HI distribu- 
tion is generally not known, an estimate of the beam at- 
tenuation correction is based on the adoption of a model 
HI distribution and an average telescope beam power 
pattern. The power patterns of the Arecibo, 42 m, 91 m 
and 100 m telescopes are assumed to be Gaussians with 
half power beam width (HPBW) as given in Table 1. 
For the Nangay telescope, the beam width is assumed to 
be a two-dimensional Gaussian with HPBW of 4' in the 
R.A. direction and 20' in the Dec. direction. The HI 
distribution is modeled as the sum of two Gaussians, one 
of negative amplitude to produce a central hole (Shostak 
1978; Hewitt, Haynes & Giovanelli 1983, hereafter HHG). 
Following Section IV of HHG, the HI surface brightness 
distribution <r H (r) is assumed to be characterized by the 
same axial ratio and to lie along the same position an- 
gle as the optical galaxy and is scaled such that -D70, 
the diameter encompassing 70% of the HI mass, equals 
1.3 a, where a is the optical major axis, in arcminutes, 
taken from either the Uppsala General Catalog (Nilson 
1973, hereafter UGC) or the ESO-Uppsala Survey of the 
ESO(B) Atlas (Lauberts 1982, hereafter ESO), or esti- 
mated by us on the POSS-I 5 . Following HHG, the frac- 
tion /1 of the source flux that is detected by the telescope 
beam is given by 



h 



^=1,2 



TT2 



.7=1,2 3 



(i) 



Like HHG, we assume a double Gaussian an of ampli- 
tude 02 = — 0.6ai and relative extent 62 = 0.236*i. Note 
that 9b above is not the HPBW but rather the beam 
extent at which the power pattern falls by e , assum- 
ing a Gaussian response. The inclination i of the galaxy 
is derived from the observed optical axial ratio b/a and 
an assumed intrinsic axial ratio that varies from 0.10 for 
the latest spirals to 0.23 for the earliest following Section 
IV.e of Haynes & Giovanelli (1984; HG84). 

The correction factor ci that accounts for beam atten- 
uation is the inverse of this fraction, ci — 1/fi > 1 and 
approaches 1 when the HI distribution is significantly 
smaller than the telescope beam. Because the HI extent 
of Hl-normal spirals is significantly larger than their op- 
tical dimension, the application of this correction is very 
important for most HI observations conducted at Arecibo 
with its relatively small HPBW (<~ 3.5'). For example, 
face-on galaxies with optical sizes equal to about half the 
HPBW (i.e., a <~ 1.7') have corrections of about 25%; 

5 The National Geographic Society - Palomar Observatory Sky 
Atlas (POSS-I) was made by the California Institute of Technology 
with grants from the National Geographic Society. 



a similar edge-on HI disk requires a correction half as 
large. For an object inclined by ~ 45° and an optical 
extent equal to the HPBW, the correction amounts to 
^60%. Because the HI disk can normally be traced well 
beyond the optical extent, the observed HI line flux in 
face-on systems as small as 0.8' require corrections of 5% 
or more. The other telescopes that have been employed 
have substantially wider beams, and there are few galax- 
ies observed with them for which this correction is sig- 
nificant. 

Additionally, if the source HI distribution is not cen- 
tered in the telescope beam, S b s will be less than it 
would be in the absence of such pointing offset. Depar- 
tures from centering arise both from the intrinsic accu- 
racy of a radio telescope to center on the input coordi- 
nates and from errors in the target coordinates relative 
to the true source position, and they are important only 
when such offsets subtend a substantial fraction of the 
beam extent. For older observations conducted with the 
Arecibo line feeds, the rms pointing accuracy of the tele- 
scope system was ~ 15" in both R.A. and Dec. The 
precise amplitude of this error is not known but has been 
assumed to be random and of order 5% (HG84). There- 
fore, all HI lines fluxes derived from spectra obtained at 
Arecibo prior to 1993 have been multipled by a constant 
factor of c 2 = 1.05 to account for this systematic bias. 
This correction is not applied to other datasets. 

Similarly, older datasets are also affected by the inaccu- 
racy of the coordinates input to position the telesope on 
source at the time the observations were conducted. In 
such cases, the flux is also underestimated by an amount 
which depends on the ratio of the angular displacement 
of the center of the telecope beam from the true source 
location to the beam extent and is important for some of 
the older Arecibo line feed data. For spectra in our dig- 
ital archive, a measure of the amount by which the flux 
has been underestimated because of coordinate inaccu- 
racy can be calculated because the input coordinates are 
recorded in the digital spectra. To estimate this correc- 
tion factor, we have performed a set of numerical sim- 
ulations mimicking observations offset from the Arecibo 
line feed beam center, by convolving the beam power pat- 
tern with template galaxy HI distributions following the 
method developed in HHG. These simulations confirm 
that the measured flux in the simulations is relatively in- 
sensitive to the position angle of the galaxy (assumed to 
follow the optical image), but quite sensitive to the angu- 
lar displacement of the beam center from the center of the 
HI disk. Following again HHG's result that D70 = 1.3a, 
we relate the angular displacement 6 of the targeted po- 
sition from the true position and the optical apparent 
diameter a to the flux correction C3, as shown in Figure 
1. For each galaxy observed with the Arecibo line feeds 
for which the pointing offset exceeds 0.1', we adopt the 
flux correction associated with the closest match in both 
optical size and pointing offset shown in Figure 1. The 
figure illustrates the point that the required flux correc- 
tion can be very large even for optically small galaxies, 
if the pointing offset is large. However, there are only 
183 galaxies for which this correction boosts the flux by 
a factor of C3 =1.5-2.0, and only 62 for which the flux 
correction is more than a factor of C3 =2. 

Lastly, optical depth effects also can result in a de- 
crease of detected HI line flux if not all the photons 
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Fig. 1. — Simulated pointing offset flux correction C3 for older AO 
line feed observations for galaxies of different optical sizes. The flux 
correction is the ratio of a galaxy's actual flux to its observed flux 
given its size and the offset of the commanded telescope position 
during the observations from the optical galaxy center. The ten 
different curves represent corrections for different optical sizes. The 
top curve is for galaxies with an optical major diameter of 0.54'. 
The next curve for 1.08', and so forth, down to 5.4'. The average 
pointing offset for all Arecibo observations is ~ 0.21' so that the 
application of this correction is significant only in a relatively small 
number of cases. 



escape. In the denser regions of disks, the interstellar 
medium can be optically thick at 21 cm (Braun 1997), 
but the filling factor of opaque regions is small in the vast 
majority of disks. Following Giovanelli et al. (1994) and 
using the optical axial ratio as the measure of disk incli- 
nation, we correct for HI self-absorption by multiplying 
the measured HI line fluxes by the factor C4 = (a/6) ' 12 . 
This effect is small, with less than 30% of the HI line 
flux self absorbed in the most highly inclined systems. 
However, there are some galaxies for which this is the 
dominant correction. Because this correction is model- 
dependent, and the interested reader may wish to apply 
a different self-absorption correction, we report the self- 
absorption corrected flux separately. 

The two values we report for the corrected HI line flux 
are then S c = ciC2C-zS bs, including the corrections for 
source extent and pointing, and S^ bs = c\C2CsC4S bs, 
which also includes the correction for HI self-absorption. 
S® ba is, on average, 25% larger than the observed line 
flux. It should be noted that all of the corrections in- 
crease the value of line flux used in the calculation of 
HI masses. Comparison of values presented here with 
others in the literature should take into account similar 
treatments in order to avoid systematic differences in the 
derived HI masses which might otherwise be interpreted 
as physical differences. 

3.1.1. Integrated Line Flux Errors 

Several different sources of error affect the measure- 
ment of integrated HI line fluxes. The first is the uncer- 
tainty in the absolute flux calibration commonly achieved 
by firing a noise diode at regular intervals (van Zee et 
al. 1997) and understanding well the telescope gain and 



its variation. Because modern telescope systems are sig- 
nificantly more stable than older ones, we follow van Zee 
et al. (1997) in estimating that the uncertainty intro- 
duced by the noise diode calibration is 2% of the observed 
integrated flux for those observations made with the 42 
m Green Bank telescope and for those made with the 
Arecibo telescope after the installation of its Gregorian 
optical system. For all other observations, we estimate 
the uncertainty introduced by the noise diode calibration 
to be 10% of the observed integrated flux. Further, we 
estimate that the uncertainty introduced by the correc- 
tion coefficients is (C1C2C3C4 — l)S b s /S. 

The integrated flux error contribution from statisti- 
cal uncertainty has been discussed by both Fouque et 
al. (1990b) and Schneider et al. (1990), both of whom de- 
rive analytical expressions for the statistical uncertainty 
Unlike the Fouque et al. (1990b) formula, however, the 
Schneider et al. (1990) formula does not include any as- 
sumptions regarding the shape of the line profile, but 
does include the contribution to the total error stem- 
ming from uncertainties in the baseline fit. We therefore 
adopt the Schneider et al. (1990) formula, with one mod- 
ification. The formula includes a dependence on Wp2o, 
the line width measured at 20% of the peak flux den- 
sity. We have our own prefered width measurement al- 
gorithm which is explained in Section 3.2. Because we 
find that the value of Wp2o/Wf5o (where Wf5o is the 
width as measured by our favored algorithm) is, on aver- 
age, ~ 1.2, we replace the 1.2Wp2o term in Schneider et 
al. (1990) with IAWfzo- e s g at , the statistical component 
of the integrated line flux error, is then given by 

eg at = 2 rmsy/lAW F50 AV (2) 

(where rms is the r.m.s. figure, measured in the signal- 
free part of the spectral baseline and AV is the velocity 
resolution of the spectrum). Our estimate of the total 
integrated line flux error, e s , is then the sum in quadra- 
ture of the error contributions from statistical noise and 
baseline uncertainties, noise diode calibration, and cor- 
rection coefficients. For the vast majority of the galaxies 
in the archive, this uncertainty is 10-15% of the corrected 
integrated flux Sf s . 

3.2. Systemic Velocities and Velocity Widths 

The systemic velocity of the HI line profile is usually 
derived as the midpoint of the segment that connects 
two points on opposite edges of the emission profile at 
a selected intensity level, usually some fraction of the 
mean or peak flux. The same algorithm is also used to 
derive the HI line width, and its extraction often dic- 
tates the choice of measurement algorithm. Over the 
years, numerous algorithms have been applied to derive 
width measures that both are robust in the presence 
of noise and most closely reflect the disk rotational ve- 
locity (Lewis 1983; Schneider et al. 1986; Chengalur et 
al. 1993). Algorithms applied originally to HI spectra 
contained in earlier datasets were optimized to produce 
consistent measurements in noisy spectra (e.g., Bicay & 
Giovanelli 1986) whereas more recent studies added em- 
phasis on sampling the rotational velocity (Lavezzi & 
Dickey 1997). 

The algorithm adopted to obtain HI line velocities and 
line widths here is very similar to that discussed by Chen- 
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Fig. 2. — Examples of 21 cm line profiles, illustrating the systemic 
velocity and velocity width measuring algorithm. The spectral res- 
olution and unit, in abscissa ("channel number") is typically 2—11 
km s~ 1 . In panel (a), an average quality profile is illustrated, with 
horns near channel numbers 196 and 340 respectively. Straight 
lines are fit to the outer slopes of the profiles (circled points were 
used in the fit), and the channels corresponding to the flux level 
equal to 50% of the peak level, on the fit, are flagged. Velocities Vi 
and V u defined in the text correspond, in panel (a), to the center of 
channels 185 and 347. Velocity width W50 (in spectrograph chan- 
nel units) and systemic velocity are indicated. In panel (b), a low 
signal-to-noise profile is shown, illustrating some of the difficulties 
that can be encountered by the algorithm in identifying the horns 
and fitting the slopes. In panel (c), an asymmetric profile is shown, 
for which the identification of one of the horns is also arduous, due 
to the structural asymmetry of the line profile. 



galur et al. (1993) and has been previously discussed in 
Haynes et al. (1999). Figure 2 illustrates the issues in- 
volved in systemic velocity and width extraction from 
global HI line profiles through simulations of represen- 
tative cases. After identification of the flux level, f p , 
of each of the spectral horns, the rising side of the pro- 
file is fit by a polynomial between the levels of 15% and 
85% of f p — rms. The velocity flagged for that side of 
the profile V\ is that for which the polynomial fit has a 
flux of 50% of the value of f p — rms. This process is 
repeated for the other side of the profile to flag a ve- 
locity V u at the corresponding level on the other horn. 
The adopted velocity width, hereafter designated Wf5o, 
is defined as the difference between these two velocities 
Wf5o =V u — Vi, while the system velocity is their aver- 
age V21 = (V u + Vi)/2. In the vast majority of cases, the 
polynomial fits arc straight lines. In rare cases, however, 
higher order fits are required and can be selected during 
the analysis stage. 

This procedure has a major advantage over width algo- 
rithms which measure a width as the difference in veloc- 
ities between points on the spectrum itself at N% of the 
peak flux. Under the scheme adopted here, noise effects 
on either side of the profile are averaged out. Addition- 
ally, the use of the parameter f p — rms rather than simply 
f p substantially reduces the dependence of the measured 
width on the signal-to- noise ratio (SNR). The most im- 
portant flaws in the method are encountered when it is 
difficult to identify the location of the horns, such as 



in low signal-to-noise profiles as shown in panel 2(b), or 
when one of the horns is missing as in the strongly asym- 
metric profile shown in panel 2(c) (see, e.g. Richtcr & 
Sancisi 1994 and Haynes et al. 1998). In these cases, the 
width measurement loses objectivity and accuracy and 
its derived error may not reflect the true uncertainty. 
For this reason, we also assign a width "quality index" 
to each result listed in Table 3. 

To convert Wp 5a into a physical rotational velocity, 
four corrections must be applied: 1) an instrumental 
correction that accounts for the spectrometer resolution, 
smoothing applied to the spectrum, and SNR, 2) a cos- 
mological redshift correction, 3) a correction for turbu- 
lent motions of the HI gas, and 4) a correction for the 
inclination of the disk to the line of sight. The widths 
presented here account for the first two effects only. The 
redshift correction is simply the instrumental-corrected 
width divided by a factor of 1 +z, where z is the galaxy's 
redshift. The instrumental correction is described in the 
following subsection. Although we do not apply such a 
correction, the effect of turbulent motions on the mea- 
surement of Wf5o is discussed in Section 3.2.3. 

In addition to the width measurement algorithm de- 
scribed above, each spectrum has been measured using 
four additional algorithms: Wm50 is the width measured 
at 50% of the mean flux density, Wp5o is the width mea- 
sured at 50% of the value of the peak flux density minus 
the rms value f p — rms, \Vp20 is the width measured 
at 20% of the value of the peak flux density minus the 
rms value f p — rms, and W^pso is the width measured 
at 50% of each of the two peaks minus the rms value. 
We include these values to allow the interested reader to 
compare our width measurement results to those of other 
authors. However, we have derived corrections only for 
Wf5o, not for any of the additional width measurements. 

3.2.1. Smoothing 

It is common practice to convolve radio spectra ob- 
tained with autocorrelation spectrometers with a one- 
channel-wide cosine function (in practice a [0.25,0.5,0.25] 
three-channel function: Hanning). This is done for 
two reasons. First, narrowband terrestrial interference, 
which occurs often within the observed spectral range 
even within parts of the spectrum protected by regula- 
tory agreements for radio astronomical use, introduces 
"ringing" associated with the (sinx)/x functional re- 
sponse of the spectrometer to single-channel-wide fea- 
tures. And, second, because of the finite number of 
correlator channels, the true autocorrelation function is 
truncated, resulting in its apparent multiplication by a 
rectangular function which is a multiple of the autocor- 
relation lag time; this in turn means that the true power 
spectrum has been convolved with the Fourier trans- 
form of this rectangular function, itself a "sine function" . 
The application of Hanning smoothing reduces the spec- 
tral resolution of the observed spectrum. In some cases, 
further spectral smoothing is required by the low SNR 
of the observation or for other reasons; these measure- 
ments tend to be less useful for TF applications. All the 
HI spectra presented here are Hanning smoothed once 
(smoothing code H in our data tables, e.g. Haynes et 
al. 1997); in a minority of cases, as noted in our tabula- 
tions, three-channel "boxcar" averaging plus Hanning is 
applied instead (smoothing code B). 
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3.2.2. Instrumental and Noise Effects 

Previous width measurement algorithms have often re- 
lied on measuring the width by computing the velocity 
difference between channels on either side of the profile. 
The channels are identified by either starting from inside 
the profile and searching outwards or starting outside 
and searching inwards, flagging the first channel to cross 
a flux limit that represents a particular fraction of the 
peak flux. This procedure creates a systematic bias in 
the width measurement as a function of the SNR (Lewis 
1983). As described earlier, our width measurement pro- 
cedure minimizes this effect by instead calculating the 
velocity difference between polynomials that have been 
fit to several channels. Nevertheless, there remains some 
dependence on the SNR, defined here as the ratio of the 
peak HI signal to the rms noise in the line-free portion 
of the spectrum, which we discuss in this section. 

We have studied the instrumental and noise effect via 
simulations that incorporate autocorrelator spectrometer 
channel response and a variety of observed characteris- 
tics, such as profile slope and curvature. The simulations 
involve inputting a high resolution, noiseless profile, then 
binning the flux into channels, adding noise, and mea- 
suring the width of the profile using the same algorithm 
used also on the real data. This process is applied to a 
range of simulations adopting channel resolutions rang- 
ing from 5 to 11 km s _1 , for the two different smoothing 
prescriptions, and for a variety of different profile shapes. 

To account for the spectral noise, the true SNR of 
the spectrum can be characterized by SNR = (f p — 
rms) I rms. The broadening of the width measurement 
is denoted by the parameter A, which is one half the 
difference between the measured width and the width 
that would be measured in the absence of instrumen- 
tal and noise effects (measured in channels). We can 
then account for the combined instrumental and noise 
effects by subtracting an offset from the measured width, 
parametrized as 

As = 2Av A, (3) 

where Av is the spectrometer channel separation in 
km s _1 assumed to be equal to or very close to the 
channel width) and A depends on the SNR and the type 
of smoothing. 

The simulations suggest that A is influenced by two 
competing effects. As our width measurement algorithm 
measures a best-fit line to the sides of the profile at 50% 
of the value of f p — rms, the two parameters in play are 
1) the value of f p — rms, and 2) the range of spectral 
values over which the line is fit. 

If instrumental and noise effects increase (decrease) 
f p — rms, then A will decrease (increase), as a higher 
(lower) 50% value corresponds to a narrower (broader) 
width. Even in the absence of noise, the peak flux is 
depressed by channel binning and smoothing, broaden- 
ing the measured width. For realistic values of SNR, 
however, the situation is more complicated. One can 
characterize f p — rms with the expression 

f p — rms = Max(yi + Sirms) — rms, (4) 

where yi is the flux of channel i that one would measure 
in the absence of noise and Si is a gaussian random vari- 
able describing the noise contribution to channel i. We 



find that profiles with steep inner slopes observed with 
broad channel spacings tend to produce decreasing val- 
ues of f p — rms with increasing rms. Other profile shapes 
and spectral resolutions give a somewhat more compli- 
cated dependence. So the variation of f p — rms affects 
the profile width measurement, but whether it becomes 
broader or narrower depends on the details of the profile 
shape and resolution. 

A is also influenced by the range over which the line is 
fit to the sides of the profile. The endpoints of the fit are 
found by searching outward from each peak, identifying 
the first channels with flux less than 15% and 85% of 
f p — rms. This method introduces a slight bias towards 
the inclusion of channels whose flux has been depressed 
by noise. For noisier profiles, the region over which the 
line is fit also moves closer to the peak of the profile, 
where the slope is shallower. Both of these effects prefer- 
entially select channels with flux that falls below the best 
fit line in the noiseless case, leading to narrower width 
measurements. This effect dominates the dependence of 
A on SNR over the range 0.6 < log(SNR) < 1.1 for each of 
our channel resolutions and for all realistic profile shapes, 
meaning that A increases with SNR over that range. 

The simulations show that the most relevant line pro- 
file parameters in determining A are the slopes on each 
side of each of the two peaks and the number of chan- 
nels leading up to the peaks. While A typically varies by 
~ 0.5 channels for the same profile shape with different 
values of the SNR, the variation can be even greater (~ 
1 channel) among different profile shapes and among dif- 
ferent noise realizations of the same profile for constant 
SNR, channel width, and smoothing prescription. This 
variation limits our ability to determine A precisely for 
a real profile. However, we can simply apply our simu- 
lation results for a profile with mid-range values for the 
inner and outer slopes and number of channels leading up 
to the peaks. The resulting approximate solution gives: 

A = Ai(Au) for log(SNR) < 0.6 (5) 

A = A 2 (Aw)+A' 2 (Aw)log(SNR) for 0.6 < log(SNR) < 1.1 

(6) 

A = X 3 (Av) for log(SNR) > 1.1 (7) 

The values of \\(Av), X 2 (Av), X' 2 (Av), and A3 (Aw) are 
constants for both Av < 5 km s _1 and Av > 11 km s _1 , 
but depend linearly on Av within the range 5 < Av < 1 1 
km s _1 . The values of these parameters for each of the 
channel widths and smoothing prescriptions can be found 
in Table 2. We stress that both the linear dependence of 
A on log(SNR) in the range 0.6 < log(SNR) < 1.1 and 
the linear dependence of the A,'s on (Av) in the range 
5 < Av < 11 km s _1 are approximate and completely 
empirical. We also keep the Aj's constant outside the 
range 5 < Av < 11 km s _1 solely because few of our 
observations have channel resolutions far outside of that 
range, and we have done little testing of the width cor- 
rections outside that range. 

In addition to the instrumental and smoothing cor- 
rections, the observed width must also be corrected for 
rcdshift stretch, giving: 

W c = (VF F 5o-A s )/(l+z) = {W F50 -2Av \)/{1+z) (8) 
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TABLE 2 

Velocity Width Instrumental Correction Parameters 



Av 



smoothing 



X 2 (Av) 



y 2 (Av) 



\ 3 (Av) 



< 5 km s _1 
5-11 km s- 1 
>11 km s _1 

< 5 km s _1 
5-11 km s" 1 
>11 km s _1 



H 
H 
H 

B 
B 
B 



0.005 
0.037A« - 0.18 

0.227 

0.020 
0.052Ai> - 0.24 

0.332 



-0.4685 
0.0527Ad - 0.732 

-0.1523 

-0.4705 
0.0397Ad - 0.669 

-0.2323 



0.92 



0.785 
-0.027Ad 

0.623 

0.820 
0.020At> + 0.72 

0.940 



0.395 
0.023A*; + 0.28 

0.533 

0.430 
0.062Ad - 0.12 

0.802 



3.2.3. Turbulent Motion Correction 

The HI clouds in a disk do not move along strictly cir- 
cular orbits. Even in the absence of large-scale anoma- 
lies, such as warps, bars and other global asymmetries, 
deviations from circular motion occur, such as those asso- 
ciated with small-scale dispersion and streaming motions 
near spiral arms. Bottinelli et al. (1983) have suggested 
that the contribution to the observed width of turbulent 
motions, or otherwise small-scale deviations from circu- 
lar rotation, could be expressed via a term Wt, additive 
to the circular width in the form: W = 2V max sini + W t 
where i is the disk inclination and V ma x the maximum 
rotational width. Tully & Fouque (1985) found that, 
while a linear correction to the observed width may 
well account for the effect of turbulent motions on large 
widths, for low luminosity galaxies a quadratic combi- 
nation would be more appropriate (see also Fouque et 
al. 1990a). They thus proposed a formula, their Equa- 
tion 12, that yields a linear combination at large widths, 
a quadratic combination at small widths, and a transi- 
tional form at intermediate widths. This formula was 
adopted by a variety of other authors (e.g., Broeils 1992; 
Rhee 1996; Verheijen 1997) who also measure HI widths 
at N% of the profile peak. These authors measure the 
value of the Wt parameter by minimizing the scatter be- 
tween the turbulence-corrected velocity widths and the 
velocity widths determined from rotation curves of the 
same galaxies. Verheijen (1997) reviews the results of 
the aforementioned authors, and notes that the optimal 
value of W t strongly depends on the instrumental correc- 
tion used. 

Since we do not measure velocity widths at N% of the 
profile peak, and our instrumental correction differs from 
that of these other authors, we are not justified in adopt- 
ing any of the previous values of W t . Instead, we de- 
rive our own turbulent motion correction by introducing 
a random, isotropic a — 10 km s _1 turbulent motion 
into the simulations described in the previous subsec- 
tion. We find that this depresses the peak flux, and thus 
broadens the measured profile width by moving the 50% 
value farther from the profile center. As in Verheijen 
(1997) and references therein, the amount of turbulence- 
induced width broadening is found to vary greatly from 
galaxy to galaxy, depending on the details of the profile 
shape. We do not find a discernable trend in the amount 
of width broadening with profile width or signal-to-noise 
ratio. For the sake of simplicity, we thus subtract the 
turbulence correction from the width linearly, with the 
equation: W c . t = W c + W t , where W t — 6.5 km s _1 . 
Lavezzi & Dickey (1997) likewise discuss the merits of a 



simple linear subtraction of a turbulence term. 

As stated earlier, in order to use W c for applications 
such as those involving the Tully-Fisher relation, cor- 
rections for both inclination angle and turbulence must 
be applied as well. Because of its uncertainty, the val- 
ues of corrected width W c reported here do not include 
that correction, nor is viewing angle considered. Authors 
wishing to make full use of the data should apply those 
corrections themselves. 

4. DATA COMPILATION 

All of the spectra in the archive have been newly re- 
processed using the new flux and width corrections. The 
spectra themselves will be made available through the 
U.S. National Virtual Observatory. Here, we provide 
the corrected and uncorrected spectral parameters. In 
Table 3, we present the results for all 8852 spectra for 
which we are confident that we have detections. The for- 
mat, described below, closely resembles that of Haynes 
et al. (1999) Table 1: 

Column (1). Entry number in the UGC, where appli- 
cable, or else in our private database, referred to as the 
Arecibo General Catalog (AGC). 

Column (2). NGC or IC designation, or other name, 
typically from the Catalog of Galaxies and Clusters of 
Galaxies (Zwicky et al. 1961-1968), ESO, or the Mor- 
phological Catalog of Galaxies (Vorontsov-Velyaminov & 
Arhipova 1968). Where used, the designation in the lat- 
ter is abbreviated to eight characters. 

Columns (3) and (4). Right Ascension and Declina- 
tion in the J2000.0 epoch either from the NASA/IPAC 
Extragalactic Database (NED) 6 or measured by us 
on the POSS-I. Typically, the listed positions have < 
5"accuracy. 

Column (5). The blue major and minor diameters, 
axb, in arcminutes, either from the UGC or ESO catalogs 
or as estimated by us on the POSS-I. 

Column (6). The morphological type code following 
the RC3 system. Classification comes either from the 
UGC or ESO catalogs or from our own visual examina- 
tion of the POSS-I prints. 

Column (7). The observed integrated 21 cm HI line 
flux S bs = J S dV in Jy km s _1 . 

Column (8). The corrected integrated 21 cm HI line 
flux S c , also in Jy km s , after corrections applied for 
pointing offsets (Arecibo line/flat feed spectra only) and 

6 The NASA/IPAC Extragalactic Database is operated by the 
Jet Propulsion Laboratory, California Institute of Technology, un- 
der contract with the National Aeronautics and Space Administra- 
tion. 
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source extent following Section 3.1. 

Column (9). The self-absorption corrected integrated 
21 cm HI line flux S£ , also in Jy km s _1 , after correc- 
tions applied for pointing offsets (Arecibo line/flat feed 
spectra only), source extent, and HI self- absorption, fol- 
lowing Section 3.1. 

Column (10). The uncertainty es in the self-absorption 
corrected integrated flux S^ bs , following Section 3.1.1. 

Column (11). The rms noise per channel of the spec- 
trum, rms, in mjy. 

Column (12). The emission profile signal-to-noise ra- 
tio, SNR, taken as the ratio of the peak HI line flux to 
the rms noise within the signal-free portion of the spec- 
trum. Unlike in Section 3.2.2, we do not subtract the 
rms from the peak flux before dividing by the rms. 

Column (13). The heliocentric velocity V@, in km s _1 , 
of the HI line signal, taken as the midpoint of the profile 
at the 50% level also used to measure the width (see 
below) . 

Column (14-18). The full velocity width of the HI line 
in km s _1 , uncorrected for rcdshift or other effects, us- 
ing the five measurement algorithms discussed in Section 
2.2. Widths are W F5 o,Wm50,Wp 5 o,Wp 2 o, and W 2 p 5 o 
respectively. 

Column (19). A corrected velocity width W c , in 
km s _1 , which accounts for redshift stretch, instrumen- 
tal effects, and smoothing. The correction is applied to 
Wf5o- Note that this width is not rectified for viewing 
angle or turbulence. 

Column (20). The estimated error on W c , e w , in 
km s _1 , taken to be the sum in quadrature of the mea- 
surement error and the estimated error in broadening 
corrections. The estimated error on the velocity itself, 
ey, can be approximated as e^/2 1 / 2 . 

Column (21). A code indicating the instrument used: 
(AOlf) Arecibo, line feed system; (AOff) Arecibo, flat 
feed system; (AOG) Arecibo Gregorian; (Effs) Effclsberg 
100 m; (GB300) Green Bank 91 m; (GB140) Green Bank 
42 m; (Nanc) Nancay. 

Column (22). Bandwidth of the spectrum in kHz A/. 

Column (23). Number of channels in the spectrum 

Nchan • 

Column (24). Three codes indicating the smoothing, 
width quality, and inclusion in the 2001 Arecibo obser- 
vations. 

The first code is the smoothing code: (H) single Han- 
ning only; (B) Hanning plus three-channel boxcar. 

The second is a qualitative assessment of the quality of 
the profile: (G) good; (F) fair; (S) single peak; (P) poor 
quality for TF applications; (C) confused; (A) shows ab- 
sorption. Velocity widths measured from profiles classi- 
fied as good detections should be useful for TF applica- 
tions; those measured on profiles denoted "F" should be 
used with caution. Because the width measuring algo- 
rithm is designed for application to two-horned profiles, 
the widths measured on single peaked profiles should also 
be used with caution, as they may underestimate the true 
rotation width. Widths in the last two categories cannot 
be used for TF purposes. Confusion is identified only 
when contamination from another galaxy in the beam is 
certain. 

The third code where applicable indicates, with a '*', 
inclusion in the sample observed with the Arecibo Tele- 



scope in the fall of 2001 as described in Section 2.2. 

In addition to the HI detections presented in Table 3, 
Tables 4 and 5 also summarize useful data about objects 
for which homogeneous upper limits to the HI flux can be 
calculated. Table 4 contains summary data for 156 galax- 
ies classified as "marginal" detections, that is, for those 
cases in which the signal has not been verified through 
adequate reobservation. Table columns match those of 
Table 3, except that we omit Columns 14-20 because of 
their uncertainty, and Column 24 because, in this in- 
stance, all spectra have been both Hanning and Boxcar 
smoothed. Table 4 also adds Column 25, the heliocentric 
velocity as derived from optical observations, VQ- op t, in 
km s _1 , taken from the NED. Although the parameters 
extracted from the HI line profile are uncertain, the co- 
incidence of the HI signal with the expected location in 
frequency corresponding to the optical rcdshift brings a 
measure of believability to the possible HI signal. Such 
objects may be deserving of further, deeper HI observa- 
tions which probe sensitivities significantly greater than 
those presented here. 

In Table 5, we present summary results for objects 
which are classified as the HI line nondctections, except 
that Columns 7, 8, 9, 10, 12, and 13 are also omitted. 
Column 26 adds the heliocentric velocity V cen t corre- 
sponding to the central frequency of the band searched 
for HI emission. Only objects for which the radio fre- 
quency band covered by the spectrum included the fre- 
quency expected from the optical rcdshift are listed in 
this table. Upper limits to the HI line flux can be esti- 
mated from the rms noise per channel and an assumption 
of the HI profile shape. A simple estimate of the upper 
limit to the integrated HI line flux, useful for calculat- 
ing an upper limit to the HI mass for non-detections, 
adopts a rectangular profile of height 1.5 rms SW, where 
SW is the full width of the Doppler broadened HI line, 
expected, for example, for a galaxy of a given optical 
luminosity given by the Tully-Fisher relation. 

5. DATASET CHARACTERISTICS AND LIMITATIONS 

In contrast to a wide area, HI blind survey like 
HIPASS, the archival dataset presented here is extremely 
heterogenous in terms of its sky coverage, depth and com- 
pleteness. At the same time, it contains HI line spectra 
for more than twice as many detections as the HIPASS 
HICAT (Meyer et al. 2004), with much higher typical 
sensitivity and probing a greater volume. 

The sky distribution for all detected galaxies listed in 
Table 3 is shown in Figure 3. The unevenness of sky 
coverage, with strong preference for that portion of the 
sky visible to the Arecibo telescope, is clearly evident. In 
fact, 77 % of the objects fall within the Arecibo declina- 
tion limits (-2° < Dec. < +38°). The southern cutoff at 
Dec. ~ —40° represents the limitations of the steerable 
northern hemisphere telescopes that were used. There is 
also a strong deficiency of objects in the Zone of Avoid- 
ance, not just because the archive is comprised of objects 
with known optical counterparts but also because sev- 
eral of the previous surveys have avoided regions of high 
galactic extinction for reasons of photometric accuracy 
and completeness at optical wavelengths. 

The sky distribution of targeted galaxies also reflects 
the large scale structure within the survey volume, ev- 
ident both in Figure 3 and in the redshift distribution 
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Fig. 3. — Aitoff equal area projection of the sky distribution 
of galaxies in the HI archive. Shown here are all galaxies with 
reliable HI fluxes and widths (all those listed in Table 3). The 
plot is centered at R.A. = 6 h . The thick lines trace the galactic 
latitudes b = -20°, b = 0°, and b = +20°. 
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Fig. 4. — Distribution of heliocentric radial velocities for all 
galaxies in the archive, in bins of width 200 km s _1 . Observations 
conducted at Arecibo are indicated by the shaded histogram. The 
impact of local large scale structures and the preferential coverage 
corresponding to that part of the celestial sphere visible to the 
Arecibo telescope, is evident. 



illustrated in Figure 4. The most prominent feature is 
the Pisces-Perseus Supercluster at 22 h < R.A. < 4^ and 
cz ~ 5000 km s _1 which has been the target of several 
past observing programs. Also seen clearly is the Virgo 
region toward the center of the Local Supercluster at 
R.A. ~ I2 h , cz ~ 1500 km s" 1 . 

The distributions of the rms noise per channel and SNR 
are shown in Figures 5 and 6 respectively. Because of the 
superior gain and angular resolution of the Arecibo tele- 
scope, observations undertaken with it are typically more 
sensitive than ones conducted with other instruments. 
For this reason, the fraction of the archival dataset ex- 
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Fig. 5. — Distribution of rms noise values per channel for all 
galaxies in the archive, in bins of width 0.1 mJy. Observations 
conducted at Arecibo are indicated by the shaded histogram. 
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Fig. 6. — Distribution of peak— signal— to-rms noise ratios, SNR, 
for all galaxies in the archive, in bins of width 0.05 dex. Observa- 
tions conducted at Arecibo are indicated by the shaded histogram. 



tracted from Arecibo observations is indicated separately 
in those figures. The diversity of the observational pro- 
grams which have contributed datasets to this HI archive 
is evident in these distributions. In some instances, the 
targetting of objects for the explicit purpose of measur- 
ing comparative HI content or accurate HI line widths 
has placed more stringent demands for lower rms noise 
values, or higher SNR, than, for example, those programs 
that were conducted principally as wide area redshift sur- 
veys. 

The total integrated (corrected) and peak HI line flux 
distributions are shown in Figure 7 and 8 respectively. 
Comparing these figures to the corresponding distribu- 
tions for the HIPASS/HICAT blind HI survey shown in 
Figure 9 of Meyer et al. (2004) and Figure 4 of Zwaan 
et al. (2004), it is readily apparent that the optically- 
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Fig. 7. — Distribution of corrected HI fluxes S^ bs for all galaxies 
in the archive, in bins of width 0.1 dcx. Observations conducted 
at Arecibo are indicated by the shaded histogram. The solid line 
in the bottom plot traces the case where log N ga i axiea oc —1.5 
log S° bs , which is the dependence expected of an HI flux limited 
sample. . 



targeted archive presented here contains a substantial 
number of galaxies with both integrated fluxes and peak 
fluxes well below those included in the HIPASS cata- 
log. The HICAT sample contains very few objects with 
Shi < 2.5 Jy km s -1 , whereas almost one third of the 
optically selected sample presented here have fluxes S% 
lower than that value. Likewise, HICAT strives to be 
peak flux limited and contains almost no sources with 
peak HI fluxes below 30 mJy whereas the majority of 
galaxies in the optically-selected sample have peak fluxes 
below that value. 

For an HI flux limited sample, the logarithm of the 
number of galaxies per unit log S^ bs should be propor- 
tional to —1.5 log S^ bs ; this relationship should also hold 
for the peak flux in a peak flux limited sample. The bot- 
tom panels of Figures 7 and 8 show the log-log distribu- 
tions of corrected flux S^ bs and peak HI line flux S pea k, 
with the N oc —1.5 log S line superposed for reference. 
Clearly the optically-selected sample is neither peak HI 
flux nor total HI flux limited as the Hl-blind surveys 
strive to be. However, as shown in Springob, Haynes, & 
Giovanelli (2005), with the addition of supplemental data 
from the literature, it is possible to extract an optical di- 
ameter limited and HI flux limited subsample, restricted 
to the portion of the sky lying within the Arecibo decli- 
nation limits. 

Figure 9 shows the distribution of instrument ally- 
corrected (but not turbulence- or inclination-corrected) 
widths W c for all galaxies in the archive. Objects with 
confused, asymmetric or otherwise oddly behaved spec- 
tral profiles (coded "C" , "P" or "F" in Table 3) are in- 
cluded in the full distribution, but profiles which are clas- 
sified as "G" or "S" are included in the shaded area. In 
comparison to the HIPASS/HICAT dataset, the galax- 
ies included in this optically-selected sample are pref- 
erentially faster rotators, unsuprising given the Tully- 
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Fig. 8. — Distribution of peak HI fluxes S pea k for all galaxies in 
the archive, in bins of width 0.1 dex. Observations conducted at 
Arecibo arc indicated by the shaded histogram. The solid line in 
the bottom plot traces the case of log N ga i axies oc —1.5 log S pea k, 
which is the dependence expected of a peak flux limited sample. 
The heterogenous nature of the observations that contribute to this 
sample is very evident by the lack of a cutoff minimum S pea k- 
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Fig. 9. — Distribution of corrected HI widths W c for all galaxies 
in the archive except those identified as confused, in bins of width 
0.05 dex. Observations conducted at Arecibo are indicated by the 
shaded histogram. 



Fisher relation and the much larger volume sampled here. 
This dataset is being used to derive the observationally- 
based rotational velocity function for HI disk systems 
(Springob, Haynes & Giovanelli, in preparation). 

There is one interesting issue for which much of this 
dataset is unfortunately largely unsuitcd: the study of 
global asymmetry in HI profiles (Richter & Sancisi 1994; 
Haynes et al. 1998). As discussed in Section 3.1, the 
shapes of global HI profiles observed with single dish 
telescopes are affected by both beam attenuation and 
pointing offsets and errors. Prior to its recent upgrade, 
the Arecibo telescope L-band feeds were subject to point- 
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ing errors of typically 20-30" but sometimes larger, es- 
pecially when the observations were conducted at large 
zenith angle. We therefore caution against using the 
spectra contained in the AGC digital archive for stud- 
ies of asymmetry, with the exception of the very carefully 
collected Green Bank 42 m telescope dataset obtained for 
that very purpose as discussed in Haynes et al. (1998). 

6. SUMMARY 

We present here a compilation of HI spectral param- 
eters for more than 9000 optically-targeted galaxies in 
the local universe which have been observed by us and 
our collaborators over the last 20+ years. All of the HI 
spectra have been reprocessed in a homogenous manner 
and a single set of algorithms has been used to extract 
spectral parameters which can then be corrected to phys- 
ical parameters taking into account the instrument- and 
processing-related complications such as spectral resolu- 
tion, source extent, pointing offsets and signal-to-noise. 
We use both empirical evidence and the results of simu- 
lated observations to produce recipes for deriving these 
physical parameters from the observed ones based on 
all of the information available from the digital spectra 
themselves. 

The recent blind HI surveys such as HIPASS offer the 
advantage of completeness and homogenous data pro- 
cessing and parameter extraction. By reprocessing all 
of our existing HI line spectra using software specifi- 
cally designed to exploit the details of the observing se- 
tups and the characteristics of the spectra themselves, 
we have tried to minimize the effects of instrument- and 
processing-related differences among the datasets. At 
the same time, HI completeness issues remain in optically 
targeted samples, despite their greater depth and sensi- 
tivity. It is well known that optical galaxy catalogs often 
preferentially miss the low optical surface brightness but 
Hi-rich late type galaxies that are common in the lo- 
cal universe. In contrast, blind HI surveys do not suffer 
from this strong bias (Rosenberg & Schneider 2002; Ko- 
ribalski et al. 2004). A more thorough discussion of the 
completeness of the AGC HI compilation in the context 
of the derivation of the HI mass function is presented in 
Springob, Haynes & Giovanelli (2005). Despite the lack 
of flux completeness, the redshift, emission line flux, and 
observed rotational widths available from the global HI 
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line profiles collected here provide important measures 
of fundamental galaxy properties useful to diverse stud- 
ies of galaxies in the nearby universe, especially when 
combined with complementary multiwavelength data. 

In addition to the tabulations contained herein, the ac- 
tual smoothed and baseline subtracted spectra are avail- 
able in digital form by contacting the authors. Plans are 
underway to make the entire dataset availble through the 
US National Virtual Observatory in conjunction with a 
wide area blind HI survey to be undertaken at Arecibo 
beginning in 2005. This Arecibo Legacy Fast ALFA (AL- 
FALFA) survey will make use of the new 7-feed dual 
polarization, L-band array (ALFA) to survey over 7000 
deg 2 of the sky visible to Arecibo, detecting tens of thou- 
sands of HI sources, from local, very low HI mass dwarfs 
to gas-rich massive galaxies seen to z ~ 0.06. A care- 
ful study of the detection statistics of the present opti- 
cally selected sample in such a sensitive HI blind survey 
will provide insight into the differential characteristics of 
galaxies according to optical surface brightness and HI 
content. Only the mining of datasets derived from wide 
area surveys conducted at multiple wavelengths will, in 
combination, provide the true census of all galaxies that 
occupy the local universe. 
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